News Article To Markdown avatar

News Article To Markdown

Pricing

from $50.00 / 1,000 result extracteds

Go to Apify Store
News Article To Markdown

News Article To Markdown

Extract news articles as clean, ad-free Markdown with automatic author and publish date detection.

Pricing

from $50.00 / 1,000 result extracteds

Rating

0.0

(0)

Developer

Extreme Scrapes

Extreme Scrapes

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Categories

Share

Extract news articles as clean, ad-free Markdown with automatic author and publish date detection. Strips navigation, ads, related articles, comments, and social sharing widgets.

Features

  • Clean extraction — removes nav, footer, ads, related articles, newsletters, comments, social widgets
  • Author detection — automatically extracts author name from article header
  • Date detection — automatically extracts publish date in various formats
  • Image captions — generates alt text for images lacking captions
  • Batch processing — extract multiple articles in a single run
  • Works with any news site — BBC, CNN, Reuters, NYT, TechCrunch, etc.

How It Works

  1. Provide news article URLs as input.
  2. The Actor fetches each article, stripping all non-content elements.
  3. Author and publish date are extracted from the first 20 lines.
  4. Clean Markdown with metadata is stored in the Apify dataset.

Input

{
"startUrls": [
{ "url": "https://www.bbc.com/news/technology-67988517" },
{ "url": "https://techcrunch.com/2024/01/15/some-article" }
]
}

Output

{
"url": "https://www.bbc.com/news/technology-67988517",
"author": "Jane Smith",
"publishDate": "2024-01-15",
"markdown": "# Article Title\n\nArticle content..."
}

Use Cases

  • Media monitoring and news aggregation
  • Build news datasets for AI training
  • Track coverage of specific topics
  • Feed news into summarization pipelines

Keywords

news scraper, article extractor, news to markdown, media monitoring, news parser, article scraper

Pricing

$50 per 1,000 article extractions.